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Abstract 

I Let Ps{d) be the probability that a random 0/1-matrix of size dx d is singular, and 

■ let E(d) be the expected number of 0/1-vectors in the linear subspace spanned by 

d — 1 random independent 0/1-vectors. (So E{d) is the expected number of cube 
vertices on a random affine hyperplane spanned by vertices of the cube.) 
' We prove that bounds on Ps{d) are equivalent to bounds on E{d): Ps{d) = 

O: (2-'^E{d) + ^){l + o{l)). 

^ ' We also report about computational experiments pertaining to these numbers. 

00 

cn ■ 1 Introduction 
o 

\ 0/1-polytopes arise naturally in a great variety of interesting contexts, including a promi- 
' nent role in combinatorial optimization, yet some basic characteristics of "typical" (that 

H ■ is, random) 0/1-polytopes are unknown. (For a survey of a variety of aspects of 0/1- 

>• . polytopes see [TO].) 

^ , One of the key open questions in this context is rather notorious: 

I • Pick d + 1 random vertices of the d-cvhe independently (with respect to the uniform 

distribution). What is the probability that these vectors do not form a (i-simplex? 

If we assume without loss of generality that one of these points is the origin the question 
can be rephrased: Let = [0, l]'^ be the d-dimensional unit hypercube, and let 

Md := {0,1}^^"' 

be the set of all 0/1-matrices of size d x d. 

• What is the asymptotic behaviour of the probability 

Ps{d) := Prob [det(M) = \ M e Md] 

that a random square d- dimensional 0/1-matrix is singular? 



"Supported by Deutsche Forschungs-Gemeinschaft (DFG) ZI 475/3 

"Partially supported by Deutsche Forschungs-Gemeinschaft (DFG), FZT86 and ZI 475/4 



1 



This central but difficult question has received careful attention; see Komlos [6] , BoUobas 
[2], Kahn, Komlos & Szemeredi [5]. It has been conjectured that 



Ps{d) = ^(1 + 0(1)), 

which is essentially the probability that two rows or two columns of a random matrix 
are equal. However, the known upper bounds are far off this mark; currently the best 
upper bound is Ps{d) < (1 —eY, for some rather small e > 0. (This was proved by Kahn, 
Komlos and Szemeredi in [5] with e = 0.001.) 

A closely related problem is as follows: 

• Given r random vertices Vi, . . . ,Vr oi C"^, what is the expected number of 0/1-vectors 
in the affine subspace spanned by these vectors? 

Improving a result by Odlyzko [7j, Kahn, Komlos & Szemeredi derived in [5] that there 
exists a constant C independent from d such that the probability that such an affine 
subspace contains any 0/1-vector other than Vi,...,Vr is 4Q(|)'^(1 + o(l)), provided 
that r < d — C . However, so far no results were known for the case r = d. 
In this paper we will show that determining the expected number of vertices of in the 
affine subspace spanned by d random vertices of is just as hard as determining Ps{d). 
More precisely, let Q denote the set of all linearly independent {d — l)-sets of 0/1-vectors 
of length d and for a set S of arbitrary vectors let v{S) be the number of 0/1-vectors in 
the linear subspace spanned by S. Then the following theorem holds. 



Theorem 1.1. Let 

' ' Geg 



be the expected number of 0/1 -points on the hyperplane spanned by a random linearly 
independent set of d — 1 0/1-vectors. Then 



Psid) = [^,Eid) + (1 + 0(1)). 



We can give a (trivial) lower bound for E{d) by just considering the (2) + d "fat" hyper- 
planes (faces Xi = and hyperplanes Xj — Xj = 0) containing 2'^~^ vertices each. Since 
d—1 points chosen randomly from such a hyperplane span the hyperplane with probability 
1 — (1 — eY'^ (according to [Sj) it is easy to verify that E{d) > ^(1 + o(l)). 
In fact the conjectured upper bounds on Ps{d) and E{d) are strictly equivalent: 



Corollary 1.2. As d ^ 00, 



if and only if 



Ps{d) = ^(l + o(l)) 
m = y(l+o(l)). 
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Using symmetry we could switch to an affine version, replacing Q by the set of affinely 
independent d-sets of 0/1-vectors and checking the expected value of 0/1- vectors in a 
hyperplane spanned by such a set. However, for the purpose of this paper the linear 
version will be more convenient to handle; so we will consider only hyperplanes containing 
the origin 0. 

To our knowledge the problem of determining the expected number of 0/1-vectors on a 
hyperplane h spanned by random vertices of has not been studied independently yet. 
Some basic results were derived in [2] and [5] by examining the structure of the defining 
equations a for planes h = {x & H'^ \ a^x = 0} (which is perhaps the most natural 
approach). The lemma of Littlewood-Offord (see Section [2]) a classical tool: It states that 
if all aj are nonzero then the number of 0/1-points in this plane is at most (|^^/2j)' 
coefficients satisfy additional conditions, this number can be reduced considerably (see 
Halasz [3] [1]). In order to obtain such conditions it would be of considerable interest to 
learn more about the distribution of determinants of 0/1-matrices: If o? — 1 vectors span 
a hyperplane and we write these vectors into a d x [d — 1) matrix M, then a defining 
equation a^x = is given by aj = (—1)-' det(rj(M)), where rj{M) is the matrix obtained 
from M by deleting the j-th row. 

The rest of this paper is organized as follows: In Section [2] we state some consequences 
of the Littlewood-Offord lemma. The proof of Theorem 11.11 is given in Section [3l In 
Section m we present some experimental estimates of Ps{d) for d < 30. 

Some definitions. 

We use standard vector notation a = (ai, . . . , a^)*, where d denotes the dimension. The 
expected value of a random variable X is denoted by E [X] ; the probability of an event Y 
is Prob [Y]. Define r{F) as the (linear) rank of a family or set of vectors F. 
The next definition is useful for partitioning sets of matrices into subsets with "nice" 
properties and was frequently used in the analysis of 0/1-matrices (see [2] or [5]). Given 
a d X d matrix M we define the strong rank f{M) as the largest k < d such that all 
/c-subsets of columns from M are independent. ( Equivalent ly, it is the largest k such that 
the truncation to rank k of the matroid given by the columns of the matrix m is uniform 
of rank k.) We also consider the strong rank of sets and of families of rf-dimensional 
vectors. 



2 The Littlewood-OfFord lemma 

The "Littlewood-Offord lemma" is a classical tool [2] [7j for obtaining upper bounds 
on Ps{d). 

Lemma 2.1 (Littlewood-Offord). Let s G R, n G N and let G R with \ai\ > 1 for 
1 < i < n. Then at most (^j) of the 2" sums Yl^=i — f'^^^ open interval 

(s-l,s + l). 

Corollary 2.2. Let G R, z = 1, . . . , n with at least t of the ai nonzero. Then at most 
(|t/2|)2"~* ~ '^f ^" sums X]r=i ^i'^i> ^ {0; 1} con have the same value. 
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As observed in [5], this lemma suffices to show that with very high probability the strong 
rank of a random 0/1-matrix is either close to d or at most 1. 



Lemma 2.3. Let M G Aid be a random matrix. Let E he the event that M has a 
d X {k + 1) submatrix of strong rank k for some k & {2, . . . ,d — 3j^^}. Then for large d, 



Prob [E] < 2" 



Proof. The proof follows [21 Chapter 14.2] (see also [5l Section 3.1]) and is sketched here 
for the reader's convenience. 

Let M be a random 0/1-matrix and k < d. If M contains k + 1 columns Ci, . . . , c^+i of 
strong rank k then clearly we can find a x {k + 1) submatrix of M of strong rank k by 
deleting d — k linearly dependent rows from (ci, . . . , Ck+i)- 

If we want to upper bound the probability that arbitrarily chosen columns Ci, . . . , c^+i 
have strong rank /c, then it suffices to give an upper bound on the probability that 
Ci, . . . , Cfc_|_i have rank k conditioned on the event that an arbitrary kx {k + \) submatrix 
M of (ci, . . . , Ck+i) has strong rank k: 

M has strong rank k if and only if the last column of M is a unique linear combination 
of the ffist k columns and all coefficients in this combination are non-zero. Under this 
condition the probability that any of the remaining d — k rows of Ci, . . . , c^+i satisfy 
the hnear dependency equation defined by M is at most by Lemma [2121 so the 

probability that Ci, . . . , c^+i have rank k is at most (2~'^(|^fej))'^~'^. Since there are at most 
(fc) (fe+i) ^^'"^ submatrices M we find 



We derive 



LIJ 

d\ f d 



d-k 



ln(d)J /jx/ r\ / /,\ \ d~k 



by checking that each summand in ([T]) is at most ^ if (i is large (using Stirling's formula 
and elementary, but somewhat tedious calculations). 

To complete the proof of Lemma [2.31 we observe that the event r(M) = 2 depends on the 
existence of three columns mj,mj, such that m, -|- rrij = irik, which happens with 
probability e(d^(|)'^). □ 

Corollary 2.4. Let M G Aid be a random matrix. Then 



Prob 



r(M) <d-3 



d 



\n{d) 



< 



d 



2 



t(i + o(i))- 
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3 Proof of Theorem 



1.1 



Let S C Aid be the set of singular matrices and 71 = M.d \ <S. We will partition S into 
subsets Sj <Z S,j e {1, . . . , 4} and derive precise bounds on the sizes of two of these sets 
in terms of |^| and E{d). The other two sets are small. This allows us to estimate the 



value Ps{d) = 
Let Nd:= [d 



\S\ + \TZ\ 
_ 3d 
hi(d) 



Si 

S2 
S3 

s. 



J and partition S into the disjoint sets 

: {M eMd\r{M) = d-l,f{M) = l} 
{M G Md I r(M) = d-l, f{M) > Nd} 
{MeMd\f{M)e{0,2,...,Nd}} 
{M eMd\ r{M) <d-l, f{M) = 1 or f(M) > A^^}. 



We will give precise estimates for the sizes of the sets TZ, Si, and S2, and check that the 
sets S3 and ^4 are small enough. More precisely, we will show that 

nd 

\g\d\— 



\Si\ 

\S2\ 

\Si\ 

\S3\ 

ISaI 



E{d) 



\Q\d\ 



d 



d 
1 



< 
< 

< 



\Q\d\ 

\S2 



d 

1 + 0(1)) 



(1 + 0(1)) 



^(1^.1 + 1^.1) 



(2) 

(3) 

(4) 
(5) 
(6) 

(7) 



for some constants Ci, C2 > 0. 



• While most matrices from Aid with two equal columns are in Si, most matrices 
with two equal rows lie in ^2. To see this, pick a random [d — 1) ^ d matrix 
N = [rii, . . . , Ud). Using the result of Kahn, Komlos and Szemeredi [5] that Ps{d) < 
(1 — eY for some e > 0.001, we obtain d{l — eY~'^ as an upper bound on the 
probability that at least one of the [d — 1) x [d — 1) submatrices Cj{N) is singular, 
where Cj {N) is the matrix obtained from N by deleting the j-th column rij. Cramer's 
rule gives Yl'j=ii~^ydjfij = for the determinants dj = det{cj{N)). Thus, has 
strong rank d — 1 ii all determinants are nonzero, which establishes ([5]): 

\Sl\ < |52|(1 + 0(1)) 

• By Lemma 12.31 a random matrix M ^ Md lies in ^3 with probability at most 
{d+ 1)2"'^. The probability that two columns are equal is d'^2~'^~^{l + o(l)). Again 
almost all matrices with two identical columns have strong rank d—1 and are in Si 
(up to an exponentially small subset), which implies ([6]): 

\S3\ = 0{-\Si\) 
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For each matrix M G 7^ U iSi U 1S2 there is at least one G ^ Q that is a subset of the 
column set of M. The estimates ([2]), ([3]) and (j4]) are obtained by examining this in detail: 



• For each G G ^ we have exactly ^(d — 1) matrices from Si containing only columns 
from G (since we have d — 1 choices for a duplicate column and ^ permutations). 
This gives (I3l): 

• For any fixed G G ^ we can construct d\{y{G) — d) different matrices 5" G 1S2 U 1S3 
(using columns from G and an additional nonzero column in the span of G that 
is not in G). Summing over G G ^ we obtain d\E{d){l + o(l))|^| matrices in 1S2, 
since ([5]) and (El) imply that I1S3I is small compared to I1S2I. On the other hand 
each matrix M G 1S2 is constructed r(M) + 1 times: If M G 5*2 and Ma = for 
some a ^ then |supp(a)| = r{M) + 1 (equality holds since r(M) = d — 1) and 
{mi, . . . , mfc„i, rUfc+i, . . . , rrid} is independent if and only if 7^ 0. This gives (j4]): 

= <;!^(i + o(i))ie|. 

• Similarly, we get d\{2'^ — E{d))\Q\ matrices in TZ and each matrix M G 7^ is con- 
structed d times. This gives ([2]): 

\n\ = ^{2''-E{d))\g\. 

A little more work is required for the upper bound ([7]) on I1S4I. So far we established an 
upper bound on the number of matrices of rank c? — 1 in terms of the number of regular 
matrices. A similar argument will be used to show that for any k < d — 2 there are 
significantly fewer matrices of rank k than matrices of rank k + 1, which gives the desired 
result: 

(i) First consider the matrices S with the property that the rows or the columns admit 
more than one trivial dependency (i.e. zero- vectors or pairs of identical vectors). 
This probability is dominated by the probability that a matrix has two pairs of 
identical rows or columns, which happens with probability 0((^)2~^'^), so clearly 
1^4 n 5| is exponentially smaller than -^(|5i| + \S2\)- 

(ii) Let S be the set of matrices whose columns or rows have a subset with strong rank 
in {2, . . . , Nd}. Lemma [2.31 gives that this happens with probability of at most 2""^, 
while the probability that two columns are equal is d'^2~'^~^{l + o(l)). This implies 
\S\<Oij,\Si\). 

(iii) To estimate the number of the remaining matrices in ^4, we use similar techniques 
as in [21 Chapter 14.2]: 
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We can use the Littlewood-Offord lemma to give an upper bound on the number of 
0/1- vectors in the span of a set of vectors C: Let a be in the orthogonal space of C, 
i.e. a*c = for all c G C Clearly all vectors v in the span of C satisfy a^v = 0. If 
s is the number of nonzero entries in a then Lemma 12.21 assures us that the span of 
C contains at most {\^f/2\)'^'^~^ 0/1-vectors. 

Let S4{k) be the matrices in 1S4 \ (5 U S) of rank k. For a fixed k < d — 2 and 
m G S^lk) we know that the columns and rows of m admit at most one trivial 
dependency (by excluding S) and that neither rows nor columns have a submatrix 
of strong rank between 2 and A^^ (by excluding S). Thus both ker(m) and ker(m*) 
contain vectors with more than A^^^ nonzero entries, since they are are at least 2- 
dimensional. Choose any such vectors a G ker(m) and b G ker(m*). 

If m is chosen uniformly at random from S^lk), then the probability that 7^ 
is at least ^ = 1 — If we condition on this event (that the last column of m 
is a nontrivial linear combination of the remaining columns) and consider all 0/1- 
matrices having the same first d — 1 columns as m, then (by the observation above) 
at most 2'^"^'' ( |^^^2j ) these matrices have rank k, since the last column v has to 

satisfy b^v = 0. Stirling's formula implies that ^'^'^''{^Nf/^i) ~ 2"^^^ = 0(^2^^). 

Removing the condition ad ^ changes the number of matrices only by a factor of 
1 -|- so we find that 



S,{k)\ 



This establishes ([7]): 
for some constant C2 > 



0{-^\S4{k + if k<d-2, 



0{-^JSi\ + \S2\) if k = d-2. 



\S,\ < ^{\S,\ + \S2\) 



Thus we have 

Ps{d) 









|7^| 


+ 







+ 


l'52|)(l + o(l)) 


\TZ\ 




+ 


l'52|)(l + o(l)) 



d-1 , E{d) \ 
2 ^ d I 



d-1 I E{d}_ , 2d-E{d) 
2 d d 



Yd^id) + ^) (1 + 0(1)) 
This concludes the proof of Theorem 11.11 

□ 
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4 Experiments in small dimensions 



Complete enumeration of the 0/1-matrices of size d x d is feasible up to dimension 7 (see 
[TU]), while hyperplanes were enumerated up to dimension 8 (see Aichholzer & Auren- 
hammer [1]). For some higher dimensions we generated 25,000,000 random matrices and 
determined an experimental probability Px{d) that a random matrix is singular. The 
significance of these numbers is limited for high dimensions (we found very few singular 
matrices and 25 million is tiny compared to the number of 0/1-matrices), but since the 
number of singular matrices is sharply concentrated around the expected value the results 
should still be close to the real values. Up to dimension 17 Px{d) decreases at a slower rate 
than the natural lower bound d'^2~'^ while in higher dimensions Px{d) seems to approach 
this bound. 
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Note added in proof: 

Recently, T. Tao and V. H. Vu [8] have significantly improved the upper bound on Ps{d), 
by proving that Psid) = (| + o{d)Y. 

Furthermore, M. Zivkovic has recently computed the number of singular 0/1-matrices 
of size 8x8 exactly [9]. From this we get that Ps(8) = 0.4492003726, so our estimate 
P,(8) = 0.4492346 wasn't bad. 
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